NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Spatial Audio Processing with Large Language Model on Wearable Devices

Mishra, Ayushi; Bai, Yang; Narayanasamy, Priyadarshan; Garg, Nakul; Roy, Nirupam (July 2025, International Conference on Machine Learning (ICML))

Integrating spatial context into large language models (LLMs) has the potential to revolutionize human-computer interaction, particularly in wearable devices. In this work, we present a novel system architecture that incorporates spatial speech understanding into LLMs, enabling contextually aware and adaptive applications for wearable technologies. Our approach leverages microstructure-based spatial sensing to extract precise Direction of Arrival (DoA) information using a monaural microphone. To address the lack of existing dataset for microstructure-assisted speech recordings, we synthetically create a dataset called OmniTalk by using the LibriSpeech dataset. This spatial information is fused with linguistic embeddings from OpenAI’s Whisper model, allowing each modality to learn complementary contextual representations. The fused embeddings are aligned with the input space of LLaMA-3.2 3B model and fine-tuned with lightweight adaptation technique LoRA to optimize for on-device processing.
more » « less
Full Text Available
Parametric study of “filament and gap” models of resistive switching in TaOx-based devices.

https://doi.org/10.1063/5.0246985

Li, Rongchen; Bai, Yang; Skowronski, Marek (March 2025, Journal of Applied Physics)
Greer, Julia R (Ed.)
A finite element model consisting of a conducting filament with or without a gap was used to reproduce the behavior of TaOx-based resistive switching devices. The specific goal was to explore the range of possible filament parameters such as filament diameter, composition, gap width, and composition to reproduce the conductance and shape of I–V while keeping the maximum temperature within the acceptable range allowing for ion motion and preventing melting. The model solving heat and charge transport produced a good agreement with experimental data for the oxygen content in the filament below TaO1.3, the filament diameter range between 6 and 22 nm, and the gap oxygen content between TaO1.7 and TaO1.85. Gap width was not limited to either low or high sides according to the criteria considered in this report. The obtained filament composition corresponds to oxygen deficiency an order of magnitude higher than one estimated by other modeling efforts. This was in large part due to the use of recent experimental values of conductivity as a function of composition and temperature. Our modeling results imply that a large fraction of atoms leaves and/or accumulates within the filament to produce a large relative concentration change. This, in turn, necessitates the inclusion of strain energy in the filament formation modeling. In addition, the results reproduce non-linear I–V without the necessity of assuming the Poole–Frenkel type of electrical conduction or the presence of a barrier at the oxide/metal interface.
more » « less
Full Text Available
Synthesis of superconducting freestanding infinite-layer nickelate heterostructures on the millimetre scale

https://doi.org/10.1038/s44160-024-00714-2

Lee, Yonghun; Wei, Xin; Yu, Yijun; Bhatt, Lopa; Lee, Kyuho; Goodge, Berit H; Harvey, Shannon P; Wang, Bai Yang; Muller, David A; Kourkoutis, Lena F; et al (May 2025, Nature Synthesis)

Full Text Available
Signatures of ambient pressure superconductivity in thin film La3Ni2O7

https://doi.org/10.1038/s41586-024-08525-3

Ko, Eun Kyo; Yu, Yijun; Liu, Yidi; Bhatt, Lopa; Li, Jiarui; Thampy, Vivek; Kuo, Cheng-Tai; Wang, Bai Yang; Lee, Yonghun; Lee, Kyuho; et al (February 2025, Nature)

Full Text Available
Scribe: Simultaneous Voice and Handwriting Interface

https://doi.org/10.1145/3631411

Bai, Yang; Shahid, Irtaza; Takawale, Harshvardhan; Roy, Nirupam (December 2023, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies)

This paper presents the design and implementation of Scribe, a comprehensive voice processing and handwriting interface for voice assistants. Distinct from prior works, Scribe is a precise tracking interface that can co-exist with the voice interface on low sampling rate voice assistants. Scribe can be used for 3D free-form drawing, writing, and motion tracking for gaming. Taking handwriting as a specific application, it can also capture natural strokes and the individualized style of writing while occupying only a single frequency. The core technique includes an accurate acoustic ranging method called Cross Frequency Continuous Wave (CFCW) sonar, enabling voice assistants to use ultrasound as a ranging signal while using the regular microphone system of voice assistants as a receiver. We also design a new optimization algorithm that only requires a single frequency for time difference of arrival. Scribe prototype achieves 73 μm of median error for 1D ranging and 1.4 mm of median error in 3D tracking of an acoustic beacon using the microphone array used in voice assistants. Our implementation of an in-air handwriting interface achieves 94.1% accuracy with automatic handwriting-to-text software, similar to writing on paper (96.6%). At the same time, the error rate of voice-based user authentication only increases from 6.26% to 8.28%.
more » « less
Full Text Available
Spatiotemporal dynamics of microbial communities and cyanobacteria blooms in two North American Lakes using long-read 16S rRNA sequencing

https://doi.org/10.1016/j.ecolind.2024.111738

Castro Berman, Manuel; Hrycik, Allison R.; Costello, Angelica; Bai, Yang; Rose, Kevin C.; Relyea, Rick; Dordick, Jonathan S. (February 2024, Ecological Indicators)

Full Text Available
NeuE: Automated Neural Network Ensembles for Edge Intelligence

https://doi.org/10.1109/TETC.2022.3214931

Bai, Yang; Chen, Lixing; Xu, Jie (April 2023, IEEE Transactions on Emerging Topics in Computing)

Full Text Available
Automated Customization of On-Device Inference for Quality-of-Experience Enhancement

https://doi.org/10.1109/TC.2022.3208207

Bai, Yang; Chen, Lixing; Ren, Shaolei; Xu, Jie (May 2023, IEEE Transactions on Computers)

Full Text Available
Interference, diffraction, and diode effects in superconducting array based on bismuth antimony telluride topological insulator

https://doi.org/10.1038/s42005-023-01288-9

Song, Xiangyu; Suresh Babu, Soorya; Bai, Yang; Golubev, Dmitry S.; Burkova, Irina; Romanov, Alexander; Ilin, Eduard; Eckstein, James N.; Bezryadin, Alexey (December 2023, Communications Physics)

Abstract It is well-known in optics that the spectroscopic resolution of a diffraction grating is much better compared to an interference device having just two slits, as in Young’s famous double-slit experiment. On the other hand, it is well known that a classical superconducting quantum interference device (SQUID) is analogous to the optical double-slit experiment. Here we report experiments and present a model describing a superconducting analogue to the diffraction grating, namely an array of superconducting islands positioned on a topological insulator film Bi_0.8Sb_1.2Te₃. In the limit of an extremely weak field, of the order of one vortex per the entire array, such devices exhibit a critical current peak that is much sharper than the analogous peak of an ordinary SQUID. Therefore, such arrays can be used as sensitive absolute magnetic field sensors. A key finding is that the device acts as a superconducting diode, controlled by magnetic field.
more » « less
Full Text Available
Novel insights into construct toxicity, strain optimization, and primary sequence design for producing recombinant silk fibroin and elastin-like peptide in E. coli

https://doi.org/10.1016/j.mec.2023.e00219

Connor, Alexander; Wigham, Caleb; Bai, Yang; Rai, Manish; Nassif, Sebastian; Koffas, Mattheos; Zha, R Helen (June 2023, Metabolic Engineering Communications)

Full Text Available

« Prev Next »

Search for: All records